Semantic Variation in Idiolect and Sociolect: Corpus Linguistic Evidence from Literary Texts

نویسنده

  • Max M. Louwerse
چکیده

Idiolects are person-dependent similarities in language use. They imply that texts by one author show more similarities in language use than texts between authors. Sociolects, on the other hand, are group-dependent similarities in language use. They imply that texts by a group of authors, for instance in terms of gender or time period, share more similarities within a group than between groups. Although idiolects and sociolects are commonly used terms in the humanities, they have not been investigated a great deal from corpus and computational linguistic points of view. To test several idiolect and sociolect hypotheses a factorial combination was used of time period (Modernism, Realism), gender of author (male, female) and author (Eliot, Dickens, Woolf, Joyce) totaling 16 corresponding literary texts. In a series of corpus linguistic studies using Boolean and vector models, no conclusive evidence was found for the selected idiolect and sociolect hypotheses. In final analyses testing the semantics within each literary text, this lack of evidence was explained by the low homogeneity within a literary text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A corpus stylistic approach to Shakespearian soliloquies

A popular interest in Shakespeare has been matched in recent years by an increasing number of computer-assisted analyses of the plays. Although not without their critics, corpus stylistic studies have offered scope and reliability in the study of literary texts, particularly through key word analyses. In this paper, I show how Wmatrix, a web-based corpus processing environment (Rayson, 2003, 20...

متن کامل

Stylistics: Corpus Approaches

Stylistics, which may be defined as the study of the language of literature, makes use of various tools of linguistic analysis. Corpus linguistics is opening up new vistas for the study of language, and there are interesting similarities in the approaches of stylistics and corpus linguistics. Stylistics is a field of empirical inquiry, in which the insights and techniques of linguistic theory a...

متن کامل

The evolution of the meaning of the word nurse based on the classical texts of Persian literature

Background and Aim: The semantic evolution of a word over time is inevitable, indicating a social, political, religious or cultural process. Nurse is one of the words that has a significant presence in Persian literature texts and has been used in many different meanings such as slave, servan, maid, devotee, obedient, patient and preserver. The purpose of this study is to show its semantic ev...

متن کامل

Introducing the Austrian Baroque Corpus: Annotation and Application of a Thematic Research Collection

This paper gives an overview of a relatively new thematic corpus based on German sacred literature of the Baroque period. At present, the digital collection consists of several texts specific to the memento mori genre. All texts in the Austrian Baroque Corpus (ABaC:us) have been enriched with different layers of structural information and tagged using automated tools adapted to the specific nee...

متن کامل

Validation d'une méthodologie pour l'étude des marqueurs de la segmentation dans un grand corpus de textes

This research aims at validating a methodology for the study of segmentation markers in large corpora. Two indices signalling a thematic break in a text are proposed. The first is based on the presence of a paragraph mark and employs the odds ratio to identify the best markers. The second takes into account lexical cohesion between sentences via an index resulting from latent semantic analysis....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers and the Humanities

دوره 38  شماره 

صفحات  -

تاریخ انتشار 2004